Detecting Prominence in Conversational Speech: Pitch Accent, Givenness and Focus
نویسندگان
چکیده
The variability and reduction that are characteristic of talking in natural interaction make it very difficult to detect prominence in conversational speech. In this paper, we present analytic studies and automatic detection results for pitch accent, as well as on the realization of information structure phenomena like givenness and focus. For pitch accent, our conditional random field model combining acoustic and textual features has an accuracy of 78%, substantially better than chance performance of 58%. For givenness and focus, our analysis demonstrates that even in conversational speech there are measurable differences in acoustic properties and that an automatic detector for these categories can perform significantly above chance. Disciplines Computer Sciences Comments Sridhar, V., Nenkova, A., Narayanan, S. & Jurafsky, D., Detecting Prominence in Conversational Speech: Pitch Accent, Givenness and Focus, 4th Conference on Speech Prosody, 2008 This conference paper is available at ScholarlyCommons: http://repository.upenn.edu/cis_papers/729 Detecting prominence in conversational speech: pitch accent, givenness and focus Vivek Kumar Rangarajan Sridhar, Ani Nenkova, Shrikanth Narayanan, Dan Jurafsky 1 University of Southern California 2 University of Pennsylvania 3 Stanford University [email protected], [email protected], [email protected], [email protected]
منابع مشابه
The perception of phrasal prominence in English, Spanish and French conversational speech
Since Bolinger’s [1] discovery that pitch cues accentual prominence in English, a tension has arisen between two strategies: equating accent with pitch excursions and relying on perception for identifying accented words. This paper investigates the relation between prominence judgments from untrained listeners and accentual labels produced by trained transcribers. Naïve speakers of English, Spa...
متن کاملAgainst a Unified Analysis of Givenness and Focus
The role of information structure in determining the placement of pitch accent in English is often reduced to notions of Focus (e.g. Chomsky, 1971; Vallduvı́, 1990; Rooth, 1992; Roberts, 1996) and Givenness (e.g. Chafe, 1974; Schwarzschild, 1999; Féry & Samek-Lodovici, 2006; Selkirk, 2007). In question-answer pairs like the following,1 the default sentential stress pattern of English, where the ...
متن کاملTowards Hierarchical Prosodic Prominence Generation in TTS Synthesis
We address the problem of identification (from text) and generation of pitch accents in HMM-based English TTS synthesis. We show, through a large scale perceptual test, that a large improvement of the binary discrimination between pitch accented and non-accented words has no effect on the quality of the speech generated by the system. On the other side adding a third accent type that emphatical...
متن کاملAn Automatic System for Detecting Prosodic Prominence in American English Continuous Speech
A precise identification of prosodic phenomena and the construction of tools able to properly manage such phenomena are essential steps to disambiguate the meaning of certain utterances. In particular they are useful for a wide variety of tasks: automatic recognition of spontaneous speech, automatic enhancement of speechgeneration systems, solving ambiguities in natural language interpretation,...
متن کاملPre-focal givenness and accentuation in Estonian
A well-known factor affecting sentence prosody is Information Structure, including the givenness vs. newness of the information conveyed by a constituent. In many languages, givenness is expressed prosodically either by deaccentuation (e.g. [1]) or by a less prominent realisation of the accent, which may be achieved either by phonological means like accent type (e.g. [2]), or by phonetic means ...
متن کامل